Search CORE

7 research outputs found

Predicting RNA secondary structure by the comparative approach: how to select the homologous sequences

Author: A Lescoute
AM Rosenblad
C Papanicolaou
C Woese
C Zwieb
C Zwieb
D Chiu
D Gautheret
D Mathews
D Matthews
D Sankoff
E Bindewald
F Rousset
F Tahi
F Tahi
Fariza Tahi
I Hofacker
J Brown
K Han
K Horimoto
L Vawter
M Szymanski
M Zuker
N Savill
O Perriquet
P Baldi
P Doty
P Higgs
PP Gardner
R Nussinov
RJ Klein
RR Gutell
S Freier
S Lindgreen
Stéfan Engelen
WC Curtis
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background The secondary structure of an RNA must be known before the relationship between its structure and function can be determined. One way to predict the secondary structure of an RNA is to identify covarying residues that maintain the pairings (Watson-Crick, Wobble and non-canonical pairings). This "comparative approach" consists of identifying mutations from homologous sequence alignments. The sequences must covary enough for compensatory mutations to be revealed, but comparison is difficult if they are too different. Thus the choice of homologous sequences is critical. While many possible combinations of homologous sequences may be used for prediction, only a few will give good structure predictions. This can be due to poor quality alignment in stems or to the variability of certain sequences. This problem of sequence selection is currently unsolved. Results This paper describes an algorithm, <it>SSCA</it>, which measures the suitability of sequences for the comparative approach. It is based on evolutionary models with structure constraints, particularly those on sequence variations and stem alignment. We propose three models, based on different constraints on sequence alignments. We show the results of the <it>SSCA </it>algorithm for predicting the secondary structure of several RNAs. <it>SSCA </it>enabled us to choose sets of homologous sequences that gave better predictions than arbitrarily chosen sets of homologous sequences. Conclusion <it>SSCA </it>is an algorithm for selecting combinations of RNA homologous sequences suitable for secondary structure predictions with the comparative approach.</p

HAL Evry

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints

Author: AV Uzilov
B Gulko
B Knudsen
B Knudsen
B Morgenstern
D Sankoff
DH Mathews
DH Mathews
DH Mathews
DKY Chiu
DS Fields
E Rivas
G Storz
I Holmes
I Holmes
I Holmes
IL Hofacker
IL Hofacker
IL Hofacker
J Gorodkin
J Gorodkin
J Gorodkin
J Reeder
J Wuyts
J Wuyts
JE Hopcroft
JE Tabaska
JH Havgaard
M Zuker
M Zuker
MS Waterman
NR Pace
O Perriquet
PP Gardner
R Durbin
R Giegerich
R Green
R Lück
R Nussinov
RD Dowell
RD Dowell
Robin D Dowell
RR Gutell
RR Gutell
RR Gutell
S Batzoglou
S Griffiths-Jones
Sean R Eddy
SR Eddy
SV Muse
V Juan
VR Akmaev
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: We are interested in the problem of predicting secondary structure for small sets of homologous RNAs, by incorporating limited comparative sequence information into an RNA folding model. The Sankoff algorithm for simultaneous RNA folding and alignment is a basis for approaches to this problem. There are two open problems in applying a Sankoff algorithm: development of a good unified scoring system for alignment and folding and development of practical heuristics for dealing with the computational complexity of the algorithm. RESULTS: We use probabilistic models (pair stochastic context-free grammars, pairSCFGs) as a unifying framework for scoring pairwise alignment and folding. A constrained version of the pairSCFG structural alignment algorithm was developed which assumes knowledge of a few confidently aligned positions (pins). These pins are selected based on the posterior probabilities of a probabilistic pairwise sequence alignment. CONCLUSION: Pairwise RNA structural alignment improves on structure prediction accuracy relative to single sequence folding. Constraining on alignment is a straightforward method of reducing the runtime and memory requirements of the algorithm. Five practical implementations of the pairwise Sankoff algorithm – this work (Consan), David Mathews' Dynalign, Ian Holmes' Stemloc, Ivo Hofacker's PMcomp, and Jan Gorodkin's FOLDALIGN – have comparable overall performance with different strengths and weaknesses

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Digital Commons@Becker

Transat—A Method for Detecting the Conserved Helices of Functional RNA Structures, Including Transient, Pseudo-Knotted and Alternative Structures

Author: A Gultyaev
A Mironov
A Mironov
A Mironov
A Pyle
A Xayaphoummine
A Xayaphoummine
AP Gultyaev
AV Seliverstov
B Knudsen
B Knudsen
B Lewicki
B Onoa
B Tucker
B Voss
B Voss
C Flamm
C Flamm
C Heine
C Hyeon
C Notredame
C Witwer
C Yanofsky
C Yeang
D Herschlag
D Mathews
D Morse
D Repsilber
DH Mathews
DH Mathews
DH Mathews
E Alemán
E Nudler
EP Nawrocki
F Kramer
G Cochrane
H Clausen-Schaumann
H Isambert
H Touzet
I Hofacker
I Hofacker
I Holmes
I Holmes
IM Meyer
IM Meyer
Irmtraud M. Meyer
J Boyle
J Felsenstein
J Manuch
J Pedersen
J Pedersen
J Ruan
JH Havgaard
JHA Nagel
JS Mattick
K Gerdes
K Gerdes
K Gerdes
K Neugebauer
L Danilova
L Zhang
M Andronescu
M Geis
M Wolfinger
M Zuker
M Zuker
MY Chao
Nicholas J. P. Wiebe
O Perriquet
P Carninci
P Gollnick
P Stadler
PP Gardner
PP Gardner
PP Gardner
R Lee
RD Dowell
Ron Unger
RR Gutell
S Cao
S Griffiths-Jones
S Griffiths-Jones
S Harlepp
S Janssen
S Krishanthi
S Morgan
S Washietl
S Wuchty
SL Heilmann-Miller
SR Eddy
SR Eddy
T Ro-Choi
TN Wong
W Winkler
W Winkler
W Zhang
W Zhang
X Tang
X Tang
Y Ding
Y Ji
Publication venue: Public Library of Science
Publication date: 01/06/2010
Field of study

The prediction of functional RNA structures has attracted increased interest, as it allows us to study the potential functional roles of many genes. RNA structure prediction methods, however, assume that there is a unique functional RNA structure and also do not predict functional features required for in vivo folding. In order to understand how functional RNA structures form in vivo, we require sophisticated experiments or reliable prediction methods. So far, there exist only a few, experimentally validated transient RNA structures. On the computational side, there exist several computer programs which aim to predict the co-transcriptional folding pathway in vivo, but these make a range of simplifying assumptions and do not capture all features known to influence RNA folding in vivo. We want to investigate if evolutionarily related RNA genes fold in a similar way in vivo. To this end, we have developed a new computational method, Transat, which detects conserved helices of high statistical significance. We introduce the method, present a comprehensive performance evaluation and show that Transat is able to predict the structural features of known reference structures including pseudo-knotted ones as well as those of known alternative structural configurations. Transat can also identify unstructured sub-sequences bound by other molecules and provides evidence for new helices which may define folding pathways, supporting the notion that homologous RNA sequence not only assume a similar reference RNA structure, but also fold similarly. Finally, we show that the structural features predicted by Transat differ from those assuming thermodynamic equilibrium. Unlike the existing methods for predicting folding pathways, our method works in a comparative way. This has the disadvantage of not being able to predict features as function of time, but has the considerable advantage of highlighting conserved features and of not requiring a detailed knowledge of the cellular environment

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

MDC Repository

Finding the common structure shared by two homologous RNAs

Author: H. Touzet
M. Dauchet
O. Perriquet
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref

Finding the common structure shared by two homologous RNAs

Author: H. Touzet
M. Dauchet
O. Perriquet
Publication venue
Publication date: 01/01/2003
Field of study

Motivation: CARNAC is a new method for pairwise folding of RNA sequences. The program takes into account local similarity, stem energy, and covariations to produce the common folding. It can handle all RNA types, and has also been adapted to align a new homologous sequence along areference structured sequence

CiteSeerX

Two-step genetic programming for optimization of RNA common-structure

Author: B. Knudsen
B.-T. Zhang
E. Rivas
G.B. Fogel
J. Gorodkin
J.-H. Chen
J.R. Koza
L. Cai
M. Zuker
O. Perriquet
S.R. Eddy
T.J. Macke
Y. Sakakibara
Y. Sakakibara
Publication venue
Publication date: 01/01/2004
Field of study

Abstract. We present an algorithm for identifying putative non-coding RNA (ncRNA) using an RCSG (RNA Common-Structural Grammar) and show the effectiveness of the algorithm. The algorithm consists of two steps: structure learning step and sequence learning step. Both steps are based on genetic programming. Generally, genetic programming has been applied to learning programs automatically, reconstructing networks, and predicting protein secondary structures. In this study, we use genetic programming to optimize structural grammars. The structural grammars can be formulated as rules of tree structure including function variables. They can be learned by genetic programming. We have defined the rules on how structure definition grammars can be encoded into function trees. The performance of the algorithm is demonstrated by the results obtained from the experiments with RCSG of tRNA and 5S small RNA.

CiteSeerX

Crossref

Consensus folding of unaligned RNA sequences revisited

Author: A. Nahvi
A. Vitreschak
B. Knudsen
C. Lawrence
D. Bouthinon
D. Kampa
D. Mathews
D. Sankoff
G. Pavesi
G. Storz
H. Touzet
I. Hofacker
I. Hofacker
I. Tinoco
International Human Genome Sequencing Consortium
J. Gorodkin
J. Gorodkin
J. Jaeger
J. Thompson
M. Levitt
M. Waterman
M. Zuker
M. Zuker
N. Bray
O. Perriquet
R. Knight
R. Nussinov
R. Nussinov
S. Eddy
S. Eddy
S. Griffiths-Jones
T. Smith
V. Bafna
Y. Ji
Publication venue
Publication date: 01/01/2005
Field of study

Abstract. As one of the earliest problems in computational biology, RNA secondary structure prediction (sometimes referred to as “RNA folding”) problem has attracted attention again, thanking to the recent discoveries of many novel non-coding RNA molecules. The two common approaches to this problem are de novo prediction of RNA secondary structure based on energy minimization and “consensus folding” approach (computing the common secondary structure for a set of unaligned RNA sequences). Consensus folding algorithms work well when the correct seed alignment is part of the input to the problem. However, seed alignment itself is a challenging problem for diverged RNA families. In this paper, we propose a novel framework to predict the common secondary structure for unaligned RNA sequences. By matching putative stacks in RNA sequences, we make use of both primary sequence information and thermodynamic stability for prediction at the same time. We show that our method can predict the correct common RNA secondary structures even when we are only given a limited number of unaligned RNA sequences, and it outperforms current algorithms in sensitivity and accuracy.

CiteSeerX

Crossref